Self-Attention-Based Edge Computing Model for Synthesis Image to Text through Next-Generation AI Mechanism
نویسندگان
چکیده
Image synthesis based on natural language description has become a research hotspot in edge computing artificial intelligence. With the help of generative adversarial networks, field made great strides high-resolution image synthesis. However, there are still some defects authenticity synthetic single-target images. For example, will be abnormal situations such as “multiple heads” and mouths” when synthesizing bird graphics. Aiming at problems, text generation model SA-AttnGAN self-attention mechanism is proposed. (Attentional Generative Adversarial Network) refines features into word sentence to improve semantic alignment images; initialization stage AttnGAN, used stability text-generated model; multistage GAN network superimpose, finally Experimental data show that outperforms other comparable models terms Inception Score Frechet Distance; analysis shows this can learn background colour information correctly capture heads mouths. The structural components improved, AttnGAN generates incorrect images mouths.” Furthermore, successfully applied description-based clothing with good generalization ability.
منابع مشابه
Contourlet-Based Edge Extraction for Image Registration
Image registration is a crucial step in most image processing tasks for which the final result is achieved from a combination of various resources. In general, the majority of registration methods consist of the following four steps: feature extraction, feature matching, transform modeling, and finally image resampling. As the accuracy of a registration process is highly dependent to the fe...
متن کاملEdge Model Based High Resolution Image Generation
The present paper proposes a new method for high resolution image generation from a single image. Generation of high resolution (HR) images from lower resolution image(s) is achieved by either reconstruction-based methods or by learning-based methods. Reconstruction based methods use multiple images of the same scene to gather the extra information needed for the HR. The learning-based methods ...
متن کاملImprovement of generative adversarial networks for automatic text-to-image generation
This research is related to the use of deep learning tools and image processing technology in the automatic generation of images from text. Previous researches have used one sentence to produce images. In this research, a memory-based hierarchical model is presented that uses three different descriptions that are presented in the form of sentences to produce and improve the image. The proposed ...
متن کاملText-Guided Attention Model for Image Captioning
Visual attention plays an important role to understand images and demonstrates its effectiveness in generating natural language descriptions of images. On the other hand, recent studies show that language associated with an image can steer visual attention in the scene during our cognitive process. Inspired by this, we introduce a text-guided attention model for image captioning, which learns t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Mathematical Problems in Engineering
سال: 2022
ISSN: ['1026-7077', '1563-5147', '1024-123X']
DOI: https://doi.org/10.1155/2022/4973535